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1 Introduction 

Quantitative studies of natural languages have led to significant advances 
in the understanding of word statistics [Tl[2] and language evolution 
A comparatively less explored (albeit extremely important) problem con- 
cerns the dynamics of word usage. Some representative examples include 
the study of bursts and lulls in word recurrence in online communities [S], 
distributions of n-grams in books written over the past 200 years [B], and 
analysis of word content in Twitter posts to assess temporal changes in per- 
ceived happiness |3- Language evolution and word statistics are related to 
word dynamics, as illustrated, for example, by early findings that word fre- 
quency itself is a correlate of word success at historical time scales jSJIH] ■ At 
shorter time scales, however, this relation is more subtle and remains far less 
understood. 

For time scales of just a few years, we have recently shown that word 
niche is a stronger determinant of future change in word frequency usage 
than the initial word frequency itself |10j . The niche of a word was defined 
in terms of the number of people and topics making use of the word and 
quantified by dissemination coefficients D^'\ These measures were applied 
to large records of Usenet groups spanning approximately two decades, in 
which people are represented by Usenet users and topics are represented 
by the discussion threads. In particular, the results in |10j show: (i) that 
the dissemination across users, D^{ti), and threads, D^{ti), at a time ti 
are both strongly positively correlated with the change in log-frequency 
Z\log/t2,ti = logio /(*2)-logio fih) for t2-ti of a few years; {ii) the changes 
in dissemination AD^!^ ^^ ^ {t2) - {h) and ADj^ j.^ = D'^ {t2) ~ D'^ {h) 
are both negatively correlated with A\og ft^^ti over the same time intervals. 

Here, we explore the relation between dissemination and frequency change 
using simple models for the population of word users. We interpret our results 
using data from two Usenet groups [TT]: the comp.os.linux.misc group, which 
is focused on Linux operating systems and has 28,903 users and 140,517 
threads for the period 1993-08-12 through 2008-03-31, and the rec. music. hip- 
hop group, which is focused on hip-hop music and has 37,779 users and 94,074 
threads for the period 1995-02-08 through 2008-03-31. In these datasets, each 
post represents a unit of text and is associated with a user and thread, while 
each thread itself is defined by the initial post and all replies. Examples 
of the variation of word frequency in these datasets are shown in Fig. [TJ 
Using our model and analysis of these datasets, we show that increase in 
frequency not accompanied by concurrent increase in the number of users is 
reflected as a decrease in and subsequent frequency fall. This, along with 
the observations (i) and (ii), illuminates the mechanistic difference between 
temporary and persistent frequency changes and helps explain why most 
frequency rises are just transient. We focus on modeling D'^, with the view 
that analogous results hold for . 

We also explore signatures of this behavior over longer time scales by con- 
sidering a digitized collection of over 2.4 million books published in English 
between 1820 and 2000 [12]. In this case, the dissemination is considered 
across different books, which captures characteristics of both word users and 
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topics. This dataset allows us to demonstrate that our observations are not 
unique to informal, Internet-based communications, and that they do in fact 
concern properties inherent to language change in general. 

We believe these results are timely as numerous studies are being carried 
out on statistical physics aspects of natural languages. Such studies have 
considered properties on scales ranging from individual letters [13] to thou- 
sands [13] or even millions [S] of words, and often benefit from concepts 
such as phase transitions |15[|16] and techniques such as network represen- 
tation [T7l[T8l[T9l[20] . In this context, increasing attention has been given to 
the modeling of language usage and language change (see, e.g., jSl l^Tll^l^ 
1241125] ). Our study of factors distinguishing persistent from temporary word 
frequency change contributes to this growing body of literature. 



2 Results 

2.1 Dissemination Coefficient 

We define the coefficient of dissemination of each word w across users as 

= -J^, (1) 

where is the number of occurrences of the word in the dataset, Uw is 
the number of users whose posts include word w at least once, and U is the 
expected number of users predicted by a baseline model in which words are 
randomized across users and threads. 

Specifically, the baseline is defined from U = ^f=iUi, where Nu is the 
total number of users and Ui is the probability that user i would use word w 
at least once if all words in the dataset are shuffled randomly while keeping 
fixed the sizes of the posts. The probability Ui can be calculated as the 
complement of the probability that the user never uses the word: Ui = 1 ~ 



j=0 



1 



Na-J 



where N,ui be the number of occurrences of the word w, 

rrii be the total number of words contributed by user i, and Na = ~ 
rrii is the total number of words in the dataset. In our datasets, rrii/NA ^ 
1 and /„, = N^/Na <C 1, which allows us to further simplify this expression 
to C/i w 1 — e^-^™™'. This represents a Poissonian baseline model in which 
the probability of using word w is given by the observed word frequency fyj. 
For the rest of the paper, we drop the index w for simplicity. 

Therefore, the expected value of is 1 for a word that is distributed 
randomly across all users. The main purpose of introducing this measure is 
to detect deviations from random. In particular, < 1 represents words 
that are clumped and hence used above average by a subset of all users. For 
example, the word "yep" shown in Fig. [Ijb) has varying between 0.36 
and 0.90 over different half-year windows. Clumping is in fact observed for 
most words in our datasets (89% of the words in the Linux group and 90% 
of the words in the hip- hop group). On the other hand, D'-^ > 1 represents 
words that are over disseminated and hence more evenly distributed across 
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users than expected by chance. Greetings and expressions of gratitude, such 
as thanks, tend to be in this class. We refer to [TU] for more information about 
the distribution of for the Usenet datasets we consider. 



2.2 Statistical Model 

We discuss a class of models that offer insights into how changes in / are 
related to changes in . This relation is key in discriminating between 
persistent and temporary word frequency growth. 

Assume that each user i and word w are characterized by two quantities: 
rrii e [0, oo), which is the size of the user's total contribution to the text in 
number of words, and € [0, 1], which is a fixed probability of using the 
word w as opposed to any different word. To simplify the calculations, we 
assume m to be a continuous variable. For each given word, a population 
of large size Nu is then described by the joint probability density function 
p{m, u) from which the relevant observable quantities can be calculated. In 
particular, within this model, the frequency of w is given by 

J _ ^ _ ir ^'^ lo miyp{m, v) 
dm mp,n{m) 

where pm{m) = ,Q dv p{m,v). Moreover, the expected fraction of users of 
word w is 

— = 1-/ dm. dv p{m,v)e-"''' , 
Jo Jo 

and the baseline is 

Jo "^"^ Jo '^'^ ~ ^) = ^- 

where the last term indicates the Laplace transform {Cg){y) = dx g[x)e''^^ 
It follows from the ratio between the previous two equations that the dis- 
semination IS given by 

_ 1 - dm jl dv p{m, iy)e-""' 

i^{cp„m ' 

where / is given by Eq. 

Therefore, given a probability distribution p(rn,v), Eq. ([3]) provides a 
quantitative relation between frequency and dissemination. As wc proceed 
to our analysis of pertinent implications, we note that the main assumption 
involved in this derivation is that users behave independently. That is, the 
size of their contributions as well as their individual word frequencies are 
independent of those of the other users. Nevertheless, this description is still 
quite general as it allows for an arbitrary relation between m and v. 
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2.3 Examples 

Example 1: Assume that with respect to a word w each user belongs to one 
of two distinct groups. In the first group, formed by a fraction < (? < 1 of 
the population, the users use the word with fixed frequency v = v* . In the 
second group, formed by the complementary fraction 1 — g of individuals, 
the users use the word with a negligible frequency (v ~ 0^). For simplicity 
we consider that all users contribute the same amount to the text, say m* 
words. Under these conditions, we have 

p{m, v) = 5{m - m*)[q5{v - v*) + (1 - q)5{v - 0+)]. (4) 

In this case, Eq. ^ results in the simple relation 

/ = (5) 

and Eq. (jSj leads to 

D'' =q ^ ' _ , (6) 

where the term m*u* corresponds to the average number of times each user 
uses the word w. 

Word usage changes over time not only in frequency but also in dissemi- 
nation. While the frequency in Eq. grows linearly with both v* and g, the 
dissemination coefficient in Eq. ^ increases with q but decreases with m*v* . 
To understand the significance of this, we examine the two different scenar- 
ios shown in Fig. [^a). In the first scenario, the frequency v* remains fixed 
but the fraction q of the population using the word changes over time; for 
increasing this represents a situation in which the overall frequency / in- 
creases because the word is used by more individuals. In the second scenario, 
the frequency v* changes, while the fraction q of users of the word remains 
fixed; for increasing v* , this corresponds to a case in which the frequency / 
rises simply because the word is used more repetitively by the same individ- 
uals. It is then clear that an increase in either v* or q leads to an increase in 
the overall frequency {Alogf > 0), but increase in j/* without a concurrent 
increase in q leads to a decrease in dissemination {AD^ < 0) even though 
the number of adopters of the word does not decrease. On the other hand, 
an increase in g, and hence in the number of actual users of the word, causes 
both frequency and dissemination to increase. Given that (ti) is strongly 
positively correlated with AXogft^^n [IHI, it is clear that the first scenario 
may lead to sustainable growth in frequency while the second may not. 

These conclusions do not depend sensitively on the assumption that the 
users contribute the same amount to the text. For example, replacing Eq. (|4]) 
with p(to, I') = pm{m)[q5{v — u*) + (1 — q)5{v — 0+)] leads to the same 
relation for the frequency and to a slightly less explicit expression for the 
dissemination, 

1 - dm pm{m)e~""'' 



1 ~ dm p™(TO)e-™'^*9 ' 



which is qualitatively similar to Eq. ([6|) if the distribution Pm{m) is peaked 
around a certain average m* . 
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Example 2: In the example above the variables m and v are assumed to be 
independent, i.e., p{m,v) = pm{m)py{v), meaning that the probability of 
using the word w is independent of the size of the contribution of the user. 
More generally, this case leads to 

f = [ dv vp^{v), (7) 







and 



We have previously observed that pmim) follows a log- normal distribution 
for the datasets considered here [TU]. In addition, by considering words of 
sufficiently high frequency to generate reliable statistics, we suggest that 
Pi/i'^)\i/>o too can be approximated by a log-normal distribution. Therefore, 
we consider the case in which In to is a normal distribution with average 
(In m) and standard deviation ai^ m for the whole population, and In v is 
a normal distribution with average (In v) and standard deviation ai^ v for a 
fraction q of the population. Here, q represents the fraction of users with v > Q 
and hence a non-negligible probability of using the word under consideration; 
the remaining fraction 1 — g of users do not use the word and are assigned 
V = 0+. 

Figure mb) shows a realization of this model for a choice of parameters 
representative of those in our datasets. Like in the case of the previous exam- 
ple, in the scenario in which the number of users is varied by controlling g, 
the resulting changes in the overall frequency are accompanied by concordant 
changes in dissemination (Z\log/ x AD^ > 0); conversely, the scenario in 
which the frequency is varied for a fixed number of users (now by controlling 
(Inz^)), the changes in the overall frequency are accompanied by opposing 
changes in dissemination (Z\log/ x AD^ < 0). As already mentioned, owing 
to the positive correlations between dissemination and subsequent frequency 
changes [10] , the first of these two scenarios will generally lead to more sus- 
tainable changes in frequency. This implication is demonstrated explicitly in 
the next section. 



2.4 Empirical Observations 

To test the behavior of words in real datasets, we performed additional anal- 
ysis in the Linux and hip-hop Usenet groups [TT]. Motivated by Fig. we 
focus on concurrent changes of both log / and . Specifically, we measured 
Z\ log /t2,ti and AD^^ for non-overlapping half-year windows centered at 
times ti and t2 ~ ti + At years. We consider all words in the intermediate 
frequency range 10^^ ^ / < 3 x lO""*, for which has been observed not 
to depend strongly on /. This independence facilitates analysis of the sepa- 
rate influence of frequency and dissemination on frequency change. In order 
to avoid floor effects on extremely low-frequency words and ceiling effects 
on extremely high-frequency ones, this was implemented by only selecting 
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words that appear more than 5 times in both windows and with a frequency 
no larger than 3 x 10^*^ in any window. 

In our analysis, words are strings composed only by the symbols "a — 
z,' , — " and are subjected to no additional lemmatization (we refer to |10j 
for the filtering of spams in our datasets). Taking all windows into account, 
32, 795 different unique words passed these criteria for the Linux group and 
27, 869 for the hip-hop group, corresponding to more than 40% of the whole 
text in each case; the whole text consists of 7.2 x 10^ and 5.3 x 10^ word 
occurrences, respectively. 

Figure [3] shows Alogf and AD'^ for h = 1998-01-01 and Z\< = 2 years. 
The distribution of words in each scatter plot is centered around the origin 
and spread over all quadrants. However, the distribution is clearly biassed 
towards the second and fourth quadrants. This is a manifestation of the 
negative correlations that dominate the relation between frequency change 
and dissemination change. The tendency of A log / and AD^ to vary in 
opposite directions is evident also from the running median in A log / as a 
function of AD^ (Fig- [21 continuous lines). In view of the properties of the 
statistical model in Fig. [21 this indicates that, for most words exhibiting a 
significant variation in overall frequency, the observed variation occurs due 
to a change in the usage rate among existing users of the word rather than 
a change in the number of individuals adopting the word. 

To verify the generality of these observations, we consider the values of 
the running median at AD^ = ±0.5 as a quantitative indicator of the general 
relation between Z\log/ and AD'^ . As shown in Fig. [4l^a, b), this indicator 
does not change substantially when we vary the position ti of the initial 
window. This implies that the conclusions drawn from Fig. [3] are in fact 
typical in our datasets for changes in frequency and dissemination over the 
time scale of a few years. Moreover, Fig.|4|c, d) shows that similar robustness 
is also observed when we vary this time scale, represented by the time At 
between the initial and final window. The values of the median of A log / 
at AD^ = ±0.5 increase slightly for large At, but this can be attributed 
in part to the criterion > 5, which has the effect of selecting against 
negative frequency changes and does so more strongly as the time between the 
windows is increased. The distance between these values is therefore a more 
informative measure than the values themselves, and this measure does not 
change substantially with At. In all cases, the median of A log / at AD^ = 
—0.5 is significantly larger than at AD'-^ = +0.5, confirming that large short- 
term variations in frequency and dissemination tend to oppose each other. 
Nevertheless, for given ti and At, a significant number of individual words 
do exhibit variations in frequency and dissemination that are concurrently 
increasing or decreasing, as illustrated in Fig. [31 

Finally, we demonstrate that frequency changes for which A log / x AD^ > 
arc indeed more persistent than those for which A log / x AD^ < 0. Fig- 
ure[5Ua, b) illustrates this point by showing for ti = 1998-01-01 how a change 
in log / acquired over At = 2 years sustains itself after 2 more years accord- 
ing to the quadrant the word belongs to in the representation of Fig. [31 The 
running medians (Fig. [5ja, b), dotted and continuous lines) indicate that 
the words belonging to the first quadrant {Alogf > 0, AD^ > 0) exhibit a 
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larger increase in frequency after At + 2 years than the words in the second 
quadrant (Z\log/ > 0, AD^ < 0). Likewise, although to a smaller extent, 
the words belonging to the third quadrant (Z\log/ < 0, AD^ < 0) tend 
to exhibit a larger final decrease in frequency than the words in the fourth 
quadrant (Z\log/ < 0, AD^ > 0). As shown in Fig. [5jc, d), for both the 
Linux and the hip-hop datasets, these systematic differences are statistically 
significant and continue to exist when ti is varied. 



2.5 Confirmation over Longer and Larger Scales 

We consider the Google Books Ngram Corpuses of English-language publica- 
tions over the period 1820-2000, which includes a total of 2,424,241 books [T^ . 
Starting with the raw data, we performed an initial cleaning to remove non- 
words. We focused on words formed by any combination of letters, apostro- 
phes, and internal hyphens, containing at least 3 letters and less than 50 
characters. Within this dataset, upper- and lower-case letters are treated as 
different words, but it can be argued that distinguishing case has little im- 
pact on our results. This leads to a dataset of 1.7 x 10"'^^ words. Within this 
set, we study the dissemination properties of words with average frequency in 
the interval 10^^ < ./ < 10"^, which results in 6.8 x 10^° words and 632,912 
unique words. In calculations of the dissemination coefficient, we further limit 
ourselves to words with a frequency of at least 10~^ within the corresponding 
year, which implies at least 10 occurrences of each selected word even for the 
years with the smallest number of books. 

We consider the dissemination across books, with the associated dissem- 
ination coefficient given by 

where the actual number of books using the word, B^,, and the expected 
number predicted by the baseline model B{Nw)^ are defined and calculated 
analogously to Uw and U{N^) in the user dissemination coefficient in Eq. ((T|). 
All calculations of the dissemination coefficient are performed over time 
windows of one year. Because no information is available in the database 
about the length of individual books, in estimating B{Nw) we have approx- 
imated the length of the books by their average length. We focus on books 
published no earlier than 1820 to avoid conflation of the now obsolete long 
"s" with "f" , which were not distinguished in the digitization process. Our 
choice of the period 1820-2000 is further motivated by the need to avoid years 
with extremely small and extremely large number of digitized books. 

Figurc[n]shows a summary of the empirical observations in this dataset. As 
in the case of the Usenet groups, the frequency change is negatively correlated 
with the dissemination change. This is illustrated both by considering a fixed 
At = 10 years for ti varying from 1820 to 1990 (Fig.lHlJa)) and by considering 
a fixed ti = 1820 for At varying from 10 to 180 years (Fig. El^b)). Over these 
long time scales, there are some systematic changes in Z\log/ both as a 
function of ti and as a function of At. But these changes may be partially 
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due to the heterogeneity of the dataset. For example, because recent years 
have a larger number of books (and hence of words) , the smaller A log / for 
AD^ = 0.5 for more recent t2 may be in part due to the fact that statistical 
fluctuations are less likely to push infrequent words below the frequency 
threshold 10 ~^ in recent years than in early years. More important, even 
when we consider frequency change over relatively long time intervals, the 
sign of the accompanying change in dissemination is a determinant factor for 
subsequent changes in frequency. This is illustrated in Fig.lB^c) for frequency 
changes over 20 years as determined by the frequency and dissemination 
changes over the first 10 years. These empirical observations corroborate the 
conclusion that word dissemination plays a central role in future rise and fall 
of word frequency even over long times and large social scales. 

3 Outlook 

Our demonstration that word frequency dynamics can be statistically re- 
lated to simple aspects of the users' dynamics opens new opportunities for 
the study of language dynamics in online communities. Several aspects of 
language dynamics have been traditionally addressed by tacitly assuming 
a homogeneous and essentially passive population of users. This includes, 
for example, the long-term lexical evolution and its dependency on word 
frequency [5]. This study, on the other hand, points to the importance of 
the medium in which the word is used, including its dynamics and hetero- 
geneities, which determine the niche of the word (lOj . Our results clearly 
show that short-term frequency changes in which the increase (decrease) in 
frequency is accompanied by a concurrent increase (decrease) in dissemina- 
tion are less dominant but far more persistent in a longer term. 

While we have focused mainly on the dissemination across users of a 
word, quantitatively described by the coefficient , similar results hold for 
dissemination across topics {D'^), which is another important aspect of the 
word niche. These different dimensions manifest themselves in the dissemi- 
nation across documents in formal writing, which is both topic and author 
dependent, as observed in our analysis of the dissemination coefficient 
for digitized books. In online discussion groups and other informal settings, 
because word usage reflects one's social identity, it is likely that the words 
actually used by people depend more strongly on their social network than 
on the words they know. Future research may thus provide further insight 
into word usage dynamics by accounting for the possible influence of the 
underlying social network dynamics, and point to new directions within the 
growing body of literature on cognitively and socially informed models of 
language pS] . 

Finally, we suggest that dissemination coefficients and the notion of niche 
itself can be extended to address factors contributing to success and failure 
in the spread of norms, propagation of information cascades, diffusion of 
innovation, and other processes that compete for adopters ^7\. There are 
processes, such as the dynamics of fashions and fads, in which an eventual 
widespread dissemination inhibits further adoption — a representative exam- 
ple being the selection of baby names [28] . But because the initial adoption 
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grows by imitation, even the rise of a fashion seems to depend criticaUy on 
the positive feedback of dissemination [5^. In these contexts, and in the 
dynamics of word usage too, another topic for future research concerns the 
impact of spatial patterns of dissemination (which is a major determinant in 
the survival of species and groups of species in ecological systems [30ll31ll32] ) 
and their interactions with other dissemination measures. 
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Fig. 1 Frequency dynamics for example words in the (a) Linux and (b) hip-hop 
groups. The frequency of a word is computed as the numtjer of occurrences of the 
word relative to the total number of words in a running window of half a year. 




Fig. 2 Frequency change and dissemination change for the statistical model, (a) 
Example 1: The changes Alog ft2,ti and ADY^^h are determined using Eqs. ^ and 
(IS} for m* = 100 words. Starting with q = 0.5 and v* = ul = 0.015 at time ti 
(corresponding to the origin in the diagram), two scenarios are considered at time 12'- 
1) V* = vl and < g < 1 (curve in top right and bottom left quadrants); 2) g = 0.5 
and < z^* < 1 (curve in top left and bottom right quadrants) . (b) Example 2: Same 
as in panel (a) but now using Eqs. ((Tjl and Q, for 5{v — v*) replaced by a log- 
normal distribution with crim/ = 0.8 and tunable (hiv) and for 5{m — m*) replaced 
by a log-normal distribution with crinm = 1.36 and (Inm) = 4.9. The first scenario 
is implemented using (kiv) = —4.9 and < q < 1, while the second is implemented 
using q = 0.5 and —10 < ^nv) < 0. Note that these scenarios represent respectively 
positive and negative correlations between frequency and dissemination changes. 
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Fig. 3 Frequency change versus dissemination change for the (a) Linux and (b) 
hip-hop groups. Both AD^^ ,ti and A log ft2 ,ti are calculated over half-year windows 
separated by two years, and centered on ti = 1998-01-01 and — 2000-01-01. The 
scatter plots include all words with Nu, > 5 in both windows, whereas the con- 
tinuous lines indicate the running medians and the dashed lines indicate the 5th 
and 95th running percentiles. Words with rising frequency appear above and words 
with falling frequency appear below Alog ft2,ti = 0. The higher concentration of 
points in the second and fourth quadrants indicate that frequency increase (de- 
crease) is for most words accompanied by dissemination decrease (increase), which 
corresponds to scenario 2 in Fig. [5] 
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Fig. 4 Pattern of frequency change as a function of time for the (a, c) Linux 
and (b, d) hip-hop groups, (a, b) Medians of the frequency change A\og ft2,ti as a 
function of the time ti for given ADY^^ti between —0.5 (solid squares) and 0.5 (solid 
circles); the windows are half-year wide and centered at ti, and t2 = ti + 2 years, 
(c, d) Medians of the frequency change A log ft2,ti as a function of the time interval 
At = t2—ti for given zADJ^ ^^ between —0.5 (solid squares) and 0.5 (solid circles); 
the windows are half-year wide and centered on ti — 1998-01-01, and t2 — ti + At 
years. In all panels, we consider all non-overlapping windows and the emphasized 
symbols correspond to the window pair in Fig. [S] The word selection is the same 
used in Fig. O 
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Fig. 5 Persistency of frequency change for the (a, c) Linux and (b, d) hip-hop 
groups, (a, b) Frequency change A\o%jtx-^2At,tx (after 2At years) versus frequency 
change Zi log /tj+At.ti (after At years) for t\ = 1998-01-01 and At — 2 years; all 
three windows are half-year wide. The dashed and continuous lines correspond to 
the running medians for points (shown in the background) with Alog fti+At,ti in 
the quadrants IQ, 3Q and 2Q, 4Q of Fig. O respectively, (c, d) Running medians 
as in (a, b) but now calculated using all points from all non-overlapping half-year 
windows for ti ranging from 1994-01-01 to 2004-01-01 for the Linux group and 
from 1995-07-01 to 2004-01-01 for the hip-hop group. The closed curves indicate 
the fraction of points along the corresponding directions from the origin. The word 
selection is the same used in Fig. [3] except that, in order to keep all eligible words 
of the first two windows, the condition > 5 is not imposed in the third window. 
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Fig. 6 Frequency change and dissemination change in the Google Books dataset. 
(a) Medians of the frequency change Alog ft2,ti as a function of the time ti for 
ADf^ i^ equal to —0.5 (solid squares) and 0.5 (solid circles); the windows are at 
ti and t2 = ti + 10 years, (b) Medians of the frequency change A\og ft2,ti as 
a function of the time interval At — t2 — ti for ADf^ ^^ equal to —0.5 (solid 
squares) and 0.5 (solid circles); the windows are in t\ = 1820 and t2 = t\ + At 
years, (c) Frequency change Alog ft^+2At,tx (after 2 At years) versus frequency 
change zl log /tj+zit.ti (after At years) for the aggregate collection of points cor- 
responding to t\ = 1820, 1830, 1980 and At = 10 years. Points corresponding 
to Alog fti+At,ti in the quadrants IQ, 3Q and 2Q, 4Q of the Alogf versus AD^ 
plot (not shown) are represented in red and black, respectively. Following this color 
code, the closed curves indicate the fraction of points along the corresponding di- 
rections from the origin and the dashed lines correspond to the running median for 
each quadrant. In all cases, the windows are one-year wide. 



